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Abstract 

In this paper we extend the market-making models with inventory constraints of Avellaneda and 
Stoikov {High-frequency trading in a limit-order book, Quantitative Finance Vol.8 No. 3 2008) and 
Gueant, Lohallo and Fernandez- Tapia (Dealing with inventory risk, Preprint 2011) to the case of a 
rather general class of mid-price processes, under either exponential or linear PNL utility functions, 
and we add an inventory-risk-aversion parameter that penalises the marker-maker if she finishes her 
day with a non-zero inventory. This general, non-martingale framework allows a market-maker to 
make directional bets on market trends whilst keeping under control her inventory risk. In order to 
achieve this, the marker-maker places non-symmetric limit orders that favour market orders to hit 
her bid (resp. ask) quotes if she expects that prices will go up (resp. down). 

With this inventory-risk-aversion parameter, the market-maker has not only direct control on her 
inventory risk but she also has indirect control on the moments of her PNL distribution. Therefore, 
this parameter can be seen as a fine-tuning of the marker-maker's risk-reward profile. 

In the case of a mean-reverting mid-price, we show numerically that the invontory-risk-avcrsion 
parameter gives the market-maker enough room to tailor her risk-reward profile, depending on her 
risk budgets in inventory and PNL distribution (especially variance, skewness, kurtosis and VaR). 
For example, when compared to the martingale benchmark, a market can choose to either increase 
her average PNL by more than 15% and carry a huge risk, on inventory and PNL, or either give up 
5% of her benchmark PNL to increase her control on inventory and PNL, as well as increasing her 
Sharpe ratio by a factor bigger than 2. 

Keywords: Quantitative Finance, high-frequency trading, market-making, limit-order book, inventory 
risk, optimisation, stochastic control, Hamilton-Jacobi-Bellman, PNL distribution. 

1 Introduction 
Mcirket- mcikers 

A market-maker is a trader who buys and sells assets in a stock exchange. The difference with any other 
market agent is that the market-maker is bound to make firm quotes: once she shows a buying/selling 
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quantity at a certain price, she is engaged to trade under those conditions. As Hquidity provider, a 
market-maker receives a compensation: she buys at a lower price (bid) and sells at a higher price (ask). 
This difference is called the spread. 

A market maker is exposed to two main risks, adverse selection and inventory risk. Adverse selection 
means that if the market-maker sells (resp. buys) an asset it is not necessarily good news, it could mean 
that her ask (resp. bid) price is lower (resp. higher) than it should on the current market conditions. 
Inventory risk comes into play by inbalances in the arrival of buying and selling orders: since the market- 
maker quotes both bid and ask prices, her net position depends on which quotes are executed and in 
which quantities. A market maker uses the spread to both control her inventory and compensate herself 
from adverse selection. In a nutshell, a market-maker loses money against informed traders, but she 
covers that loss by making noise (i.e. un-informed) traders pay the spread on each transaction. 

In order to create a market-making strategy, we need to consider three factors: price, spread and 
inventory. The price is often the mid-price, i.e. the average between the current ask and bid prices of the 
market. The spread of the market-maker is her only control on her PNL and inventory throughout the 
trading day. It is true that the market maker can affect the mid-price by improving the current ask and 
bid market quotes, but since that would normally trigger a market order that consumes the offer, in a 
first approach we can consider that the mid price cannot be affected by the quotes of the market maker. 

Main features of the present article 

In this article we extend the current stochastic-control models of market-making, in particular those of 
AvcUaneda and Stoikov T] and Lehalle et al j^. 

We managed to find closed-form solutions for the optimal ask/bid quotes of a market-making for mid- 
price dynamics that are not necessarily martingale, which can be interpreted of directional bets on price 
trends. Although our approach is based on optimal stochastic-control and nonlinear-PDE techniques, 
the philosophy is very simple: given the utility function we choose a possible form of the solution of the 
nonlinear PDE equation (i.e. we make an ansatz); we plug this ansatz into the equation and compute 
the (implicit) optimal controls; we then plug the (implicit)controls and separate the equation into several 
simpler ones, normally linear; we then compute explicitly the controls and the solution for the equation. 

Sometimes it is impossible to find explicitly the controls for the solution, but for a sub-solution it 
is always possible (at least for the class of processes and utility functions we are dealing with) . In that 
framework, the controls do not optimise the utility function but give a lower bound of the potential losses. 
It is worth to mention that, for practitioners, it is better to have explicit controls that minimise potential 
losses (a sub-solution) than highly implicit, numerically-intensive controls that optimise the PNL. 

Our approach can be applied to any utility functions, not only exponential as in Avellaneda and 
Stoikov [T] and Lehalle et al [5], provided it is explicit enough to admit an ansatz. This gives a lot of 
flexibility to the market-maker for the choice of her risk-reward profile. Of course, selecting a mid-price 
dynamic determines the class of utility functions that we can choose from because the utility function 
has to be finite (a.s.). However, with the most recurrent mid-price models such as martingales, Brownian 
motion with drift, Ornstein-Uhlenbeck (resp. Black-Scholes) a linear (resp. exponential) utility function 
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is finite. 

We add a new parameter, which models the inventory-risk aversion of the market-maker. As it will 
be shown in the numerical simulations, this parameter guarantees that the trading algorithm will end 
the day with a flat inventory, which is the goal of a market-maker. Moreover, it also allows the market 
maker to control its directional bets via exposure to price movements intraday. 

We show that the inventory-risk-aversion parameter not only exerces direct control on the inventory 
risk directly but it also has some indirect control on the risk in the PNL distribution of the market-maker 
(i.e. on the first four moments namely mean, variance, skewness and kurtosis). Moreover, this relation 
can be also inverted: the parameter of the exponential utility function has direct control on the PNL 
distribution and indirectly controls the inventory risk. This can be interpreted as a high risk - high reward 
scenario: big exposure to extreme events, either via fat tails or directional bets, improves the average 
PNL. 



Organisation of the study 

The goal of this study is to find the optimal ask and bid quotes for a high-frequency market-maker that, 
under the framework of a directional bet on the market trend, simultaneously maximise her PNL and 
minimise her inventory risk. The main inspiration is the paper of Avellaneda and Stoikov [T], who found 
via stochastic control the optimal bid and ask quotes for a high-frequency market-maker. The second 
inspiration comes from Lehalle et al 6 , who formalised the findings of Avellaneda and Stoikov It 
is also worth to mention that the Hamilton- Jacobi-Bellman framework we use was originally set by Ho 
and Stoll 8J, but in neither of these three articles the effect of a directional market bet on the PNL 
distribution was considered. 

In Chapter 2 we set the framework under which we will be working, which is stochastic control and 
Hamilton- Jacobi-Bellman equations. In Section 3 we completely solve the optimal control problem for 
a linear utility function and an arbitrary Markov process by finding explicitly the unique solution. In 
Section 4 we perturbate the linear utility function via a quadratic inventory penalty. By using first-order 
approximations on the arrival of orders to the limit-order book, we find explicitly the (approximate) 
solutions and their corresponding optimal controls, which are perturbations of the closed-form solution 
and controls we found in Section 3. In Section 5 we extend the results of Avellaneda and Stoikov Jl] 
and Lehalle et al [6] for exponential utility functions. We show that their approach can be used to 
more general process than arithmetic Brownian motion and with inventory-risk aversion. In Section 6 we 
perform numerical simulations to show how directional bets on the market trend affect the market-making 
strategies. We also assess the effect of the inventory-risk-aversion parameter on the PNL distribution and 
the inventory risk and show that this parameter controls directly the inventory risk and indirectly the 
PNL distribution. 
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2 Stochastic control framework 

Setting of the problem 

We suppose that the mid-price process S{t) follows an Ito diffusion, i.e. 

dS{t) = b{t, S{t))dt + cr{t, S{t))dW{t) (f ) 

where W{t) is a standard Brownian motion in a filtered probability space (f2, J^, F — {J^t)o<t<T,P)- 

A market-maker can control her ask and bid quotes, which we denote p'^it) and p~{t) respectively. 
Instead of working with the market-maker's prices we will rather work with the market-maker's spreads, 
i.e 

5+{t) p+{t) ~ S{t), 6-it) := S{t) - p-{t) . 

The market-maker's spreads are assumed as two predictable processes. Under these variables, the market- 
maker's bid-ask spread is thus 6^ + 6^ > (see Figure 1). 
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Figure 1. Description of the intensity of limit orders (LOs) as a function of the distances ((5+, 5^). 
As usual, 5^ 1—^ A* ((5*) is decreasing: the closer we are to the mid-price, the more likely our LOs are 
executed. Here we explicited the exponential decay we will use in our model. 



Strictly speaking, we should consider that (5+ > and > 0. However, these constraints render the 
optimisation problem very hard to solve explicitly due to boundary effects at = 0. In consequence, and 
since our goal is to have explicitly the values of the optimal distances {S^ , ^ ~ ) of the optimal market-maker 
quotes *), we will assume in our analysis that 6^ G M. However, in our numerical simulations we 

interpret (5* < as a market order (see Figure 2). 
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Figure 2. Extrapolation of the intensities A* of limit orders (LOs) when 5^ < (dotted lines). For our 
simulations, when (5* < we will assume they are market orders. 

In addition to the mid-price process S(t) and the predictable processes (J+jJ"), we will consider two 
other processes. On the one hand, the inventory Q(t) G Z, which varies with the execution of the limit 
orders placed by the market-maker; on the other hand, the cash process X{i) e M, which also varies as 
the market-maker buys or sells the asset. 



We will assume that the dynamics of Q(t) and X(t) are governed by 
dQ{t) = dN-{t) - dN+{t) , 

dX{t) = [S{t)+5+]dN+{t)-[S{t)-5-]dN-{t), 

where N^{t) are two independent Poisson processes of intensity A* ((5*) and 5^ H> A* ((5*) is decreasing. 
In that framework, the PNL or wealth of the market-maker is 

PNL(i) = X{t) + Q{t)S{t) e M . 

Finally, we will also assume that the market-maker has a utility function 0(s, g, x) and an associated 
value function 

u(t,s,g,x):= niax ¥.t,s,q.xU(s{T),Q(T),X{T)) 

where t e [0, T] is the current time, s — S{t) is the current mid-price of the asset, x = X{t) is the current 
cash and q — Q{t) is the current inventory level. 



It is worth to mention that all these assumptions are also present (explicitly or implicitly) in both 
Avellaneda and Stoikov Ij and Lehalle et al [6]. 



The Hamilton-Jacobi-Bellman equation 

Let us give a heuristic interpretation of the Hamilton-Jacobi-Bellman equation in terms of the two in- 
finitesimal operators, one on the continuous variable s and the other on the jump variables {q,x), as well 
as of the stochastic controls {S^,S^). 



6 



Pietro FODRA ■ Mauricio LABADIE 



First, suppose q and x fixed. Since the continuous variable s follows Q, from Fcynmann-Kac repre- 
sentation formula we have that if (t, s) i— >• uc{t, s, q, x) satisfies 

{dt + C)uc^O, C:=b{t,s)ds + ^<j\t,s)dss, uc{T,s,q,x)^(f>{s,q,x) (2) 
then (see e.g. Pham [9 ) 



ucit,s,q,x) =Et,s (f>(^S{T),q,x) 



Second, suppose now that s is fixed. We model the arrival of orders to the limit-order Book as two 
independent Poisson Process, iV+ for the ask quotes and N~ for the bid quotes , with intensity A* 
(respectively). 

• For the ask quote, we assume that A+ depends on the distance to the mid- price s, i.e. A+ = A+((5+). 
Moreover, since buying market orders favour ask quotes with the smallest spread (5^, it is natural 
to assume that i— >■ A+((S^) is increasing, i.e. the probability of execution for the market-maker 
decreases as she moves her ask quote further away from the mid price. 

• Analogously, for the bid quote we have A^ = X^{5^) and i— > X^[S^) increasing. 

The jump variables (g, x) are related via the arrival of market orders that hit the quotes of the 
market-maker: 

• Suppose a buying market order of one share of S hits the ask quote of the market-maker. It follows 
then that x i-^ x + {s + 5'^) and q g — 1. 

• Analogously, if a selling market order of one share hits her bid quote then x t-^ x + {s — S^) and 
q^ q + l. 

Under this framework, we have that the value function of the jump (g, a;) i— >• uo{t, s, q, x) satisfies 

dtUD + A+(5+) [uD{t, s,q-'l,x + {s + 5+)) - uoit, s, q, x)] (3) 
+X~{S~) [uoit, s,q + l,x - {s - S^)) - Unit, s,q,x)] = 0. 

u{T,s,q,x) = (p{s,q,x). 

Third, let us now put together the continuous and jump dynamics ([2])-(|3]). Suppose that the market- 
maker's spread quotes {S~^,S~) are known (i.e. deterministic), the value function u{t,s,q,x) satisfies has 
the following infinitesimal generator (see e.g. Ho and StoU |S]): 

(dt + C)u + X+{S+) [u{t, s,q-l,x + {s + S+)) - u{t, s, q, x)] (4) 
+X^ (S^) [u{t, s, q + 1, X — {s — 5^)) — u(t, s,q,x)'^ = 0. 

u(T,s,q,x) — (j){s,q,x) . 

Fourth, notice that Q is valid only when the spread quotes {5^ , 5~ ) are known, but in the current 
case they are part of the set of unknowns of the problem. In consequence, from the stochastic control 
theory it follows that the value function u(t, s, q, x) with unknown controls ((5+,(5^) is solution of the 
Hamilton- Jacobi-Bellman equation 

{dt + £) u + maxA+(5+) [u{t,s,q- l,x + {s + S+)) -u{t,s,q,x)] (5) 

(5+ 

+ max A^((5~) 5, g + 1, x — (s — 6~)) — u(t, q. a:)] = , 

u{T,s,q,x) = (j){s,q,x). 
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In general we should use sup instead of max in ([5]), but if we assume that b{t, s) and a{t, s) are Lipschitz 
and that the jump dynamic ([s]) is bounded then the supremum is attained and the solution u(t, s, q, x) is 
unique, as in the particular cases we will consider here. 

Fifth, Avellaneda and Stoikov [T] showed that, given the empirical evidence provided by the current 
research in Econophysics e.g. Potters and Bouchaud [T^, we can assume that X^{S) = Ae-^^. Under 
this framework, the Hamilton- Jacobi-Bellman ([s]) becomes 

{dt + £) u + maxAe"'"'^ [u{t, s, g - 1, a; + (s + 5+)) - u{t, s, g, x)] (6) 

-\-ma.xAe~^^ [u{t, s,q + l,x — {s — S~)) — u{t, s,q,x)] — 0, 
5~ 

u{T,s,q,x) = (t){s,q,x), 



Solving the Hamilton-Jacobi-Bellman equation 

Equation ^ is the one we will consider in the rest of the present work. The steps to solve it are as 
follows: 

1. Based on the utility function (f>{s,q,x) we make an ansatz, i.e. we guess the general form of the 
solution of the on Hamilton-Jacobi-Bellman equation. For example, if 

(/)(s,g,a;) ^x + ip{s,q) 

we will use 

u{t,s,q,x) = X + v{t,s,q) . (7) 

2. We substitute the ansatz on the HJB in order to find an easier HJB equation for v. We use this 
new HJB equation to find the optimal controls {6^ , S^) that maximize the jump. With the ansatz 
([t]) the optimal controls are 

5+ = y - s + v{t, s, q) - v{t, s,q~l) , (5^ = 7 + s - v{t, s,q + l)+ v{t, s, q) . 
k k 

3. We substitute the optimal controls on the HJB equation; the resulting equation is called the 
verification equation. In our case it is 

v{T,s,q) = 'fi{s,q), 

which is highly nonlinear because = S^{v). 

4. We solve the verification equation via the Feynmann-Kac representation formula in order to find 
an explicit expression of the optimal controls. Indeed, for an equation of the form 

{dt + c)w + fidt,s:;,t,s,q,x) = o, 

w\t=T = g{6t,S~,s,q,x), 



8 



Pietro FODRA ■ Mauricio LABADIE 



where {5^,5^ ) do not depend on w, the (unique) solution is (see e.g. Pham [5]) 



w{t,S,q,x) =Et,s,q,x 



f{St,S* , £.,S{£,),q, x)d^ + g{dt,S^ , S{T), q, x) 



where Et.s,q,x is the conditional expectation given S(t) = s, Q{t) = q and X(t) — x 
5. Alternatively, we could express the optimal quotes in terms of the market-maker's bid-ask spread 

V'* ■■=5t + <57 

and the mid-point of the spread (called the indifference price) 

r* \ {pt +P*)=s+^ {St - S-) . 

Notice that if (5+ = then r^{t) = s = S{t). Therefore, r^ — s measures the level of asymmetry of 
the quotes with respect to the mid-price s. 

3 Linear utility function 

Let us suppose that the utility function is linear, i.e. 

(j){s, q,x) = X + qs . 

Then the corresponding value function is 

uit,s,q,x)^ max Et,s,qA^{T) + Q{T)S{T)], (8) 

(S+.S-) 

where X{t) is the cash process, Q{t) the inventory process, S{t) the price process and Et.s.g.x is the con- 
ditional expectation given S{t) = s, Q{t) = q and X{t) = x. Alternatively, this corresponds to choosing 
the final condition as (j){s,q,x) = x + qs. 

X{T) + Q{T)S{T) is the final value of the market-maker's portfolio, and corresponds to the final PNL 
of the market-maker. Indeed, X{T) the cash she holds whilst Q{T)S{T) the cash value of her inventory: 
she holds Q{T) assets and clears them at (unitary) price S{T) on the close auction. 

Ansatz 

From the final condition (i.e. the utility function) (j}{s, q,x) = x + qs we will search a solution of the form 

u{t, s,q,x) = X + 9o{t, s) + q9i{t, s) . (9) 

Plugging ^ into ^ yields 

{dt + C) {00 +qOi)+ m&xAe-''^^ [s + 6+ - 0i] + maxAe-''^' [-s + 6' + Oi] = , 

(5+ S 

Oo{T,s) - 0, 
Oi{T,s) - s. 
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Computing the optimal controls 

Define 

f+{S+) ■.^Ae'''^* [s + 5+ -Oi] 
Using Calculus we obtain that the maximum is attained at 



C = T - s + ^1 
k 



Analogously, if 
then 



r(r) :=Ae-^-^" [-S + 5- +i 



In consequence, the optimal quotes ), spread and indifference price r^, are 



Solving the verification equation 

Since 

it follows that the ansatz 9q + q9i solves the verification equation 



71 . 



OA 

{dt + c){eo + qei) + — cosh[k{ei-s)] = o 

efc 

OoiT,s) = 0. 
Oi{T,s) = s. 



(10) 



We separate ( 10 ) in terms of the powers of q, one equation for g*^ = 1 and another for q^ = q. With 

(11) 



this procedure we obtain two coupled equations, 



2A 



and 



( df + C) do + -r cosh \k{9 1 -s)] = 
ek 

eo{T,s) = 0, 



idt+C)0i = 
9i{T,s) = s. 



(12) 



Applying the Feynman-Kac formula to ( 12 1 and recursively to (111 yields, respectively (see e.g. Pham 



i), 



2A 

hit,s)=EtJS{T)], eoit,s) = —Et^, 

Grv 



cosh 
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In consequence, 



and 



u{t,s,q,x) = x+ -^^t,s 



1^ cosh [k(0i{^, SiO) - S{0)]d^^ + qKtASin 



In particular, since cosh(a) > 1 and cosh(a) = 1 ■^=^ a = Q we obtain that 

2A 

u{t,s,q,x)>u{t,s,q,x) := x + —{T - t) + qEtAS{T)] 

and u{t,s,q,x) = u{t,s,q,x) ■^=^ 0i{S,,S{^)) = S{S,) for all ^ G [t,T]. Since i G [0,r] is arbitrary then 
taking ^ = t we have that u{t,s,q,x) = u{t,s,q,x) s = E(_s[S'(T)], i.e. if and only if S{t) is a 

martingale. 



Results 

Let us summarise all our findings. 

Theorem 1 Consider the Hamilton-Jacobi-Bellman problem 

{dt + C)u + max^e"*'''^ [u{t, s,q-l,x + {s + ^+)) - u{t, s, q, x)] 

+ maxAe~'^* [u{t,s,q + l,x — {s — 6~)) — u{t,s,q,x)\ = 0, 

u{T, s,q,x) = x + qs , 

which corresponds to a linear utility function (l){s, q,x) = x + qs, value function 

u{t,s,q,x)= max Et,,,g,^[X(T) + Q(T)5(T)] 

{S+,S ) 

and stochastic controls {5~^,6~). Then: 
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1. The optimal controls (6^,6^ ), spread "0* and indifference price (i.e. the centre of the spread) of 
the market-maker are 



5± = - ± (Et,, [S{T)] -s) 0, = 5t + K 



r,=¥.tAS{T)]. 



2. The (unique) solution of the HJB problem is 



2A^ 

u[t,s,q,x) = x-\ -Et 

ek 



cosh 



;(0i(C,5(C))-5(^))]de[+gEt,,[5(T)] 



3. The solution u{t,s,q,x) is bounded from below by 



2A 

u{t, s, g, x) := X + — (T - i) + gEt,,[5(T)] 



and 



u{t,s,q,x) = u{t,s,q,x) s = Et_5[S'(r)] , 

i.e. if and only if S{t) is a martingale. In this case we have 

1 



r,it)^S{t), St 



and s, q, x) — x -\- qs . 



Remarks 

• The worst price dynamic for the PNL-based utihty function ([S]) is a martingale, in the sense that 
with any other price dynamic the PNL is greater. But observe that the optimal spread "0* is centred 
around = Et^s[S'(T)] and not around s = S{t). In consequence, if the market-maker considers that 
the current mid-price s has deviated from its fundamental value then she can make directional 
bets via her bid-ask quotes, which yields a higher PNL than the martingale assumption if the bet 
is correct. 

• Applying perturbation methods on the variable q of the form 

u{t, s,q,x) — X -\- 9o{t, s) -f q9i{t, s) -\- q^92{t, s) + ■ ■ ■ , 

is a very rough approximation, to say the least. Indeed, as Lehalle et al [6] pointed out, q is an 
integer, i.e. discrete and not small, and as such a perturbation method on q cannot be performed. 
However, once the ansatz is shown to solve the verification equation, then by uniqueness it coincides 
with the solution of the original problem. Therefore, the separation of the equation into two terms, 
one with q^ and another with g^, is justified a posteriori via the maximum principle (i.e. existence 
and uniqueness) for the Hamilton- Jacobi-Bellman equation, and as such it does not rely at all on 
perturbation methods, as Avellaneda and Stoikov [1] suggested. 
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• In Theorem [I] we have imphcitly assumed that the value function u(t,s,q,x) is finite when we 
apphed the Fenmann-Kac formula. However, this is valid if and only if 



cosh 



< oo , 



0i{t,s)^EtJS{T)]. 



(13) 



If S{t) is a martingale then (13 1 holds trivially. For a non-martingale mid-price process S{t), two 



sufficient conditions for (13) to hold are (i) the conditional expectation Et_s['S'(T)] is affine on s 



and (J,i) the moment-generating function Mz{X) = E [exp{AS'(t)}] is finite for all A € M. This is 
the case for any Gaussian Markov process, e.g. an arithmetic Brownian motion with drift and the 



Ornstein-Uhlcnbeck process. However, ( 13 ) does not hold for the geometric Brownian motion with 
drift. 



4 Linear utility function with inventory penalty 

With the linear utility function there is no penalty if at the end of the trading day the market-maker 
carries a huge inventory. In order to force a liquidation of the inventory before the end of the day, we 
propose the following utility function, 

(l){s,q,x) = X + qs -r]q^ , > , 

which is the PNL with a quadratic penalty on the inventory. The associated value function is 



uit,s,q,x)= max Et,s,gAX{T) + Q{T)S{T) - rjQ^{T)] , 

{5+,S ) 



(14) 



A quadratic penalty function for a market-maker is already known in the literature (see e.g. Stoll 



Ansatz 

Given the form of the utility function, we will search a solution of the form 

u{t, s,q,x) = X + do{t, s) + q9i{t, s) - riq^02{t, s) . 
Plugging (15) into ^ yields 

{dt + C) (Oa + qOi - m^e^) + max Ae-'^^^ [s + 5+ -9^- 7^(1 - 2g)02] 

(5+ 

+ max Ae^'^''" [s + 5" + 6*1 - rj{l + 2q)e2] 

0o{T,s) 
OiiT,s) 
02{T,s) 



= 0, 
= 0, 



s . 
1. 



(15) 
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Computing the optimal controls 

As in the previous section, if 



then 

On the other hand, if 
then 



St + + 2q)02 . 



nS-) Ae-^*" [s + 6- +61- t]{1 + 2g)02] 



In consequence, the optimal quotes {6^,^^ ), spread and indifference price r^, are 

Sf ^l+vd2±{9i-s-2q7j92) , A = St + 5- = ^+27^02, = 01 - 2T^q92 . 

Solving the equation with linear jumps 

We fix g e Z and define the jump functional 

A 



MSt,S:):=^{e'-''' +e'-'''^) 



whose first-order Taylor expansion (i.e. its Frechet derivative) is 

jM^s:) = 4(4-fc(C + 0)+o(|i-fcei' + |i-fcCp) 

= 4 (2 - kvd2) + 0{\1- M+|2 + |1 - . 
Since at first order the jumps are independent of q, it follows that 

e{t, s, q) = 9oit) + q9i{t, s) - m^92{t) 

solves 

idt+C)e+^{2-kij02) = 
efc 

0(T, s, q) — qs — rjq^ . 



We separate ( 16 ) in terms of the powers of q, one equation for q^ = 1 and another for q^ ~ q. With 
this procedure we obtain three coupled equations, 

9*00 + 4(2-^2) = (16) 
efc 

^o(T) = 0, 
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{dt + £)ei = (17) 

Oi{T,s) = s, 

and 

dte2 = (18) 
^2(T) = 1. 

In consequence, 

$2 = 1, e,{t,s)=Et,s[S{T)], eo{t) = ^{2-kr,){T-t). 

Finding a sub-solution 

Since 

then the solution of the equation with linear jump (i.e. first-order Taylor) is a sub-solution of the original 
problem with exponential jump. In other words, if we define 

u{t, s,q,x) :=x + 6o{t) + qOi {s,t) - r]q^^2 {t) , 

where 6o{t), 6i{s,t) and 02{t) are defined as above then 

u{t, s, q, x) < u{t, s, q, x) , 

i.e. it is a sub-solution of the HJB equation. 

Results 

Let us summarise all our findings. 

Theorem 2 Consider the Hamilton-Jacobi-Bellman problem 

dtu + Cu -h max Ae~^^* [u(f, s^q— 1, x -|- (s -|- 15"'")) — u{t, s, q, x)] 

(5+ 

-t-maxAe"*^^ [u{t, s, q + 1, x — {s — 6^)) — u{t, s, q, x)] — 0, 

u{T, s,q,x) = x + qs - r]q^ , 

which corresponds to a linear utility function with quadratic inventory penalty <p{s, q,x) = x + qs — rjq^, 
value function 

u(t, s, q, x) = niax E*,,,,,^ [X{T) + Q{T)S{T) - vQ^{T)] 

{S+,S ) 

and stochastic controls {5~^,5~). Then: 
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1. Let u{t,s,q,x) be the (unique) solution of the HJB equation. Then 

u{t, s,q,x) := X + — {2 - kr]) {T - t) + gE*,, [S{T)] - 

ek 

is a sub-solution of the HJB equation and u(t, s, q, x) > u(t, s, q, x). 

2. With the linear aproximation of the jumps, or equivalently using the HJB of the sub-solution u, the 
optimal controls {5^,5~), spread and indifference price r* are 

5t = \+r,± (Et,, [S{T)\ - s - 2qr]) , V* = + -5" = ^ + 2r? , = E^,, [S{T)] - 2m . 



General inventory penalties 

Suppose that the utihty function is now 

4i{s,q,x) = X + qs - riq^'K{s) , V > , 

where s 7r(s) is continuous and for s > it is non-decreasing and non-negative. For example, if tt = 1 
we recover the previous case whilst if 7r(s) = we recover the classical mean-variance PNL criterion. 
The associated value function is 



u{t,s,q,x) = max Et,,,,,,[X(T) + Q{T)S{T) - rjQ\T)Tr{S{T))] . 

(S+,S ) 

With the ansatz 

u{t, s,q,x) = X -\- 6o{t, s) -\- q9i{t, s) - r]q'^62{t, s) 
it can be shown that the functions 9o — O2 solve the equations 



{dt + jC)9o + -^{2- krjOi) 



and 



whose explicit solutions are 



{dt + C) 9, 
9i{T,s) 

{dt+C)92 



9o{T) = 

= 
= s, 

-- 

= 7r(s), 



(19) 
(20) 
(21) 

(22) 
(23) 



2 A nA 

92{t,s)=Et^MS{m, ^i(i,s)=Et,,[5(T)], 9o{t,s) = —{T-t)-^Et,, 



«({))<« 



In the light of these results, we have the following extension of Theorem 2 to a general penalty function 

TT. 
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Theorem 3 Consider the Hamilton-Jacobi-Bellman problem 

dtu + Cu + niaxAe^'^''^ \u{t, s, q — 1, x + (s + S^)) — u(t, s, q, x)] 



s+ 

+ inax^e~'''^ [u(t, s, q + I, x — {s — 6^)) ~ u{t, s, q, x)] — 0, 

u{T,s,q,x) = X + qs — rjq'^TT{s) , 

which corresponds to a linear utility function with quadratic inventory penalty (j){s, q, x) = x+qs — r]q'^7r{s), 
value function 

uit, s, q, x) = max Et,s,g,. [X{T) + Q{T)S{T) - 7jQ^T)7r{S{T))] 

{5+,S ) 

and stochastic controls (6^,6^). Then: 

1. If u{t, s, q,x) is the (unique) solution of the HJB equation then 

+ qEtAS{T)]-7^q^e2{t,s), 



u{t, s, q, x) := X + '^{T - t) - —Et.s 
ek e 



where 92it,s) = ,5[7r(S'(r))], is a sub-solution of the HJB equation and u{t,s,q^x) < u(t,s,q,x). 

2. With the linear aproximation of the jumps, or equivalently using the HJB of the sub-solution u, the 
optimal controls {5^,5^), spread ip^, and indifference price r* are 

5t = l+v± i^^tASiT)] - s - 2qTjEtA^{S{Tm , V* = C + S: = ^+2r^, 
r, =EtAS{T)] - 2r,qEtAAS{T))] . 



Remarks 

• The optimal controls depend on the ansatz we make on the utility function u(t, s,q,x), i.e. on 
the functions Oi{t, s) and 92(t). However, without the hnear aproximation of the jumps our ansatz 
cannot give the solution: in fact, there is no solution with the chosen ansatz. That said, given that 
the sub-solution u is exphcit, and by definition it provides a lower bound on the real solution u, we 
can consider that the quotes we have found are optimal for the sub-solution, which is a lower bound 
on the (penalised) PNL. Under that spirit, the optimal quotes can be interpreted as conservative 
estimates of the real optimal quotes, since they minimise the potential drops on the PNL. 

• If ry = we recover the optimal controls and the sub-solution of the linear case without inventory 
penalty, i.e. Theorem [T] Therefore, our linear approximation of the jumps is consistent, in the 
sense that it provides a perturbation of the optimal quotes in terms of the "inventory-risk" or 
"risk- aversion" parameter rj. 

• When rj > the spread "0* widens and the indifference price shifts downwards (resp. upwards) 
if the inventory is positive (resp. negative), which is in line with the intuition on the inventory 
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risk. Indeed, if the net position of the market-maker is long (resp. short) then she will improve the 
current ask (resp. bid) quote to lure buyers (resp. sellers), and simultaneously she will try to hide 
her bid (resp. ask) quote deep into the limit-order Book to deter sellers (resp. buyers). By doing 
so, she favours the probability of being executed in the direction that makes her to go back to zero. 

• Notice that since the linear approximation of the jump functional (i.e. its Frechet derivative) 
does not depend on q, the solution u of the approximate verification equation does not rely on 
perturbation methods and asymptotic expansions on q. Of course, the solution u to the verification 
equation with exponential jumps will indeed depend on q, and as such an approach similar to 
Lehalle et al |6] is needed in order to deal with the discrete variable q. 

• For Theorems [2] and [3] we are assuming that the corresponding value function u{t, s, q, x) is finite in 
order to apply the Feynmann-Kac formula. In the current framework, given a mid-price dynamic 
we choose the penalty function n such that u{t, s, q, x) is finite. In that spirit, if S(t) is Gaussiann 
(e.g. arithmetic Brownian motion or Ornstein-Ulenbeck) or a martingale (even with jumps) then 
TT = 1 suffices, whilst if S{t) is a geometric Brownian motion then 7r(s) = is a viable candidate. 

• As we mentioned above, it is easy to find sufficient conditions to ensure boundedness of the value 
function u,, for 77 > 0. If uo is the explicit solution without inventory constraints of theorem [l] then 
the maximum principle shows that < < uq. Therefore, if uq is finite then u,, is bounded. 
This is true for processes S{t) such that Ej s[S'(T)] is affine in s, i.e. 

Et,s [S{T)] = a(i) + /3(t)s , a, /3 e C° [0, T] . 

In particular, Ujj is bounded if S(t) is an arithmetic Brownian motion or an Ornstein-Uhlenbeck 
process. 



5 Exponential utility function 

This case has been entirely solved by Avellaneda and Stoikov IJ and Lehalle et al [6] when the mid-price 
is a Brownian motion. In this section we show that their approach can be easily extended to several other 
mid-price dynamics, e.g. Ornstein-Uhlenbeck. 



Let us suppose that the utility function is exponential (f)(s,q,x) — — exp{— 7(2; + qs)}, whose core- 
sponding value function is 



u{t,s,q,x)= max E* „ „ „ 



exp 



[-^(X{T) + Q{T)S{T))] 



(24) 



Ansatz 



From the form of the utility function we will search a solution of the form 

u(i, s, q,x)^~ exp{-7(a; + 6{t, s, q)} , 0{t, s, q) = 6'o(t) + 96*1 (t, s) + q'^62{t) . 



(25) 
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Plugging (25) into ([6| yields the Hamilton- Jacobi-Bellman for 9{t,s,q), i.e. 

[dt +C)e-^a^^ ids0f + ^ max er''^^ [l - exp{-7(s + <5+ - + (1 - 2g)02)}] 

A 

+ -majiAe-''^ [l - exp{-7(-s + (5" + + (1 + 29)612)}] ] = 0, (26) 
75-'- ■' 

9{T, s) = X + qs , 

Computing the optimal controls 

For the function 

/+(<5+) ^e-^*^ [1 - exp{-7(s + 6+ - 0^ + {1 - 2g)02)}] 



its maximum is attained at 

St 



1 

Analogously, if 



-log(l + ^) -s + (?i-(l-2g)02. 



7 6 

then 



r{5-) - max^le-^-* [l - exp{-7(-.s + 5" + + (1 + 217)02)}] 



5- = - log (l + l)+s-ei-{l + 2(7)02 . 



7 V k/ 

In consequence, the optimal quotes {^^,5^), spread and indifference price are 



St = 
7 



^log(l + ^) -02±(^i-s + 2g02), = ^ log (1 + ^) - 202 , =01 + 2(702. (27) 
Solving the equation with linear jumps 



For q £ Z fixed we define the jump functional as : M — > M as 



MSt^S-) := ^ (e-^^." + e-^^.") . (28) 



The first-order Taylor expansion of Jq (i.e. its Frechet derivative) is 

MSt,S-) = j^{2-k{St + S-))+0{\l-k6t\' + \l-k5-\') 



2^ / k^ 

1 log 



fc + 7 \ 7 



(1 + ^) +^02^ +0{\l-k6t\^ + \l-kS-\^) 



Therefore, at first order we have that Jq — Jq for all q G Z, i.e. the jumps are independent of q. In 
consequence, 

0(i, S, q) = 0o(<) + q9i{t, s) + q^92{t) 
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solves 



1 9 A / k 

2 fc + 7 \ 7 



1- -log(l + ^) +fc02j = 

0{T,s,q) = qs. 



this procedure we obtain three coupled equations, 



We separate ( 29 ) in terms of the powers of q, one equation for = 1 and another for q^ = q. With 

(29) 



/ k 

dtOo + T— 1 - - 
/e + 7 V 7 



log (l + ^) + fc02 j = 

Oo[T) = 0, 



{dt + c)e^ = 



(30) 



and 



9*02-7^^27(9,01)' = 



(31) 



92{T) = 0. 
Applying the Feynnian-Kac formula to ( |30[ ) we find 

0i(t,s)=Ei,J5(r)]. 



Integrating (31) we obtain 



it) - ^7 f ^\s,.s{0)(dse,{i,s{s,))fdi. 



However, the ansatz we have made implies that O2 is independent of s. Therefore, in order to solve (31) 
we need to assume the following conditions on the price process S{t): 



In consequence. 



a = a[t) , Et,s [S[T)] = a{t, T) + s(3{t,T) . 

02{t) = -\i j\\OP\£..T)d^- 



Finally, integrating (30) yields 



. / N / k 

%(t) = r— 1 " - 

fc + 7 V 7 
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Finding a sub-solution 

Since 

A 



fc + 7 



> 



A 



{2-k{s:+s-)) 



then the solution of the equation with hnear jump (i.e. first-order Taylor expansion)is a sub-solution of 
the original problem with exponential jump. In other words, if we define 

u{t,s,q,x) ■.^-exp{--f{x + 0o{t) + q0i{s,t) + q^d2{t))} , 

where do{t), di{s,t) and 6*2 (t) are defined as above then 

u{t, s, q, x) < u{t, s, q, x) . 

Adding a quadratic inventory penalty 

We modify the exponential utility function (|24|) by adding a quadratic inventory penalty: 



u{t,s,q,x) = max E^.^^,^^ 

(S+J ) 



exp 



{~l{x{T) + QiT)S{T) - riQ\T)) } 



(32) 



Under this new penalty framework, the computations are exactly the same as before. The only thing 
that changes is the equation solved by 02(i), i-e. 







(33) 



i(T) = -77, 



whose solution is 



1 



2it) = -V-^ll a'{OP'{^,T)d^. 



This has an impact on 9o{t), which has the new form 
2A 



k + 



1 log 

7 



k-^A 
k + l Jt 



as well as on the optimal quotes, spread and indifference price (27). 
Results 

Let us summarise all our findings. 

Theorem 4 Consider the Hamilton- J acohi- Bellman problem 

dtu + Cu max Ae^'^'^ [u(i, s, g — 1, a; + (s + 5'^)) — u(i, s, q, a;)] 



5+ 

-\- max Ae 

s- 



[u{t,s,q+l,x-{s~S )) ~ u{t,s,q,x)] = 0, 

u{T, s, q,x) = - exp{-7(a; -I- qs - -qq^)} , 
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which corresponds to an exponential utility function (j){s, q,x) — — exp{— 7(0; + qs — r/q"^)}, value function 
u{t,s,q,x)^ max Et,,^g^J- exp \--f(x{T) + Q{T)S{T) - tjQ\T))}] , 

((5+,(5~) L L \ / J . 

and stochastic controls {S~^,S^). Assume further that the mid-price process 

dS{t) = b{t, S{t))dt + a{t)dW{t) 

satisfies 

Ei,, [SiT)] = ait, T) + s/3(t, T) , V(t, s) . 

Then: 

1. With the linear approximation of the jumps, or equivalently using the HJB of the sub-solution u, 
the optimal controls {5~^ ,6~), spread and indifference price are 



where 



2A 



A: + 7 

e^{t,s) = EtAS{T)], 

2. Let u{t, s,q,x) he the (unique) solution of the HJB equation. Then 

u{t, s, q, x) := - exp {-7(2; + i{t, s, q)) } , e{t, s, q) = 9o{t) + qOiit, s) + q^02{t) 
is a sub-solution of the HJB equation and u(t, s, q, x) > u(t, s, q, x). 



Remarks 

• The linear approximation of the jumps turns out to be independent of q, although for higher orders 
this is no longer true. This means that we are not performing perturbation methods and asymptotic 
expansions on the (discrete) variable q when we solved the approximate verification equation. Of 
course, if we want to solve the real HJB problem then we need to take into account the discrete 
variable q, which leads to an infinite system of equations as in Lehalle et al [6]. 

• In the case of 77 = the inventory penalty tends to zero as i — > T. This implies that the penalisation 
is not stong enough to force the market-maker to finish her day with a flat inventory, as it will be 
shown in the numerical simulations. Therefore, it was necesary to add an " inventory- risk" parameter 
77 > in order to ensure a flat inventory at the end of the day. 
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Examples 

• If S{t) is an arithmetic Brownian motion with drift, i.e. 

dS(t) = bdt + (7dW{t) 

then 

Et.s[SiT)] = .s + b{T-t), 

which imphes that 

01= .s + b{T -t), 02 = -V- \l^''iT - t) . 
In consequence, the optimal controls are 

5t = Uog(l+"^+f^+]pa\T-t)±(b{T-t)-q[2i^ + ^a\T-t)]), 
2 



V'* = -log(l+ I) +277 + 7a2(T-t) 
= s + &(T-t) -9(277 + 7(j2(T-<) 
In particular, if 6 = 77 = we recover the results of Avellaneda and Stoikov IJ. 

If S{t) is an Ornstein-Uhlenbeck process, i.e. 

dS{t) = a{n - S{t))dt + <jdW{t) 

then 

which implies that 

0, = se-"(^-*) + /i (1 - e-''^^-*)) , 02 = -r? - ^ (1 - e-'<^-'^) . 
In consequence, the optimal controls are 

St = -logfl + ?U77+^fl-e--(--: 



7 V /c/ 4a 

± 



((A^ - .) (1 - e-"(^-*)) - , [27; + ^ (1 - e-^<^-*)) 



g-2a(T-t) 
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Solving the nonlinear equation: Lehalle's approach 

Assuming that S{t) is a Brownian motion, Lehalle et al j6] found that the solution it(i, s, g, x) of the 
nonUnear problem can be explicitly found, but that study can be easily extended to arithmetic Brownian 
motions with time-dependent volatility and drift. 

Suppose that the mid-price process is of the form 

dS{t) = b{t)dt + a{t)dW{t) , 

where h{t) and a{t) are uniformly bounded in [0,r]. Let us make the ansatz 

u{t,s,q,x) = -eyi.Y>{-l{x + qs)}vq{t)-''''' ] € C\0,T) , geZ. (34) 

Following Lehalle et al ^ it can be shown that the optimal quotes are 




where {vq{t))q^z solves the (infinite) ODE system 

v'qit) = l^'^a^t) - ^qb{t)^ vq{t) -(^a(^1+'i^ ' j {^vq+,{t) + Vq^.H)) , (35) 
Vq{T) = 1. 



Lehalle et al [5] used a constructive proof to show that (351 has a unique, strictly positive solution in 
C°°(^[0,T));£'^{'Z)j. However, this result can be easily proven in a non-constructive fashion. 



Let E be a Banach space and consider a system on E of the form 

V'jt) = Fg{t,Vit)), 



(36) 



Vq,T > , 



where F{t,v) = {Fq{t,v))qez, ■ [0, T] x E — E is Lipshitz in v uniformly in t £ [0,T]. Applying the 
Cauchy-Picard Theorem and the maximum principle for ODEs on the Banach space ^[0, T); E^ yield 
existence, uniqueness and positivity of the solution u of (35 1 (see e.g. Brezis [2]). Unfortunately, since the 



linear function Fq(t,v) corresponding to (35) is proportional to q , we cannot apply the Cauchy-Picard 
Theorem directly. However, if we define 



E = £°°{Z) , 



w e E : sup q Vq < +oo 



(37) 



then for t fixed we have that F{t, •) : F — >■ E is linear and bounded, and the bound is uniform in t. 
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Therefore, F{t,v) is Lipschitz in v, uniformly in t. In consequence, we can now apply the Cauchy-Picard 
Theorem to ensure that there exists a unique positive solution of (35). 



Remarks on Lehalle's approach 

• If instead of the Banach spaces in ([37| we use the Hilbert spaces 




(38) 



we recover the framework of Lehalle et al [6 . 

• Our proof is non- constructive, which implies that we cannot provide explicit asymptotic estimates 
of the solution v — {vq)q^z. Lehalle et al [6], on the contrary, constructed the operator explicitly, 
and thus they were able to show the asymptotic behaviour of v based on the spectrum of the linear 
operator. 



From the ansatz (34) we see that the coefficients in the ODE system (35 1 cannot depend on s. This 



rules out mid-price processes whose drift and volatility depend on s, e.g. Ornstein-Uhlenbeck and 
geometric Brownian Motion. However, arithmetic Brownian motions with time-dependent drift and 
volatility can be used. 



6 Numerical Simulations and sample paths 

We performed several simulations of the optimal market-making strategy, i.e. the spread V'* a-nd the 
indifference price r*, under an Ornstein-Uhlenbeck mid-price process, i.e. a mean-reverting price dynamic 
of the form 

dS{t) = a(/i - S{t))dt + adW{t) . 

We considered four strategies: linear/exponential utility and mean-reverting /martingale market-making 
assumption. This allows us to assess the effect of the directional bet /i on the PNL of the market-making 
strategy. The parameters we used are k — 100, A ~ 1500, T = 1 (which corresponds to one trading 
day), n = 1000 (which corresponds to 1000 bid/ask limit orders sent per day, approx. once every 30 
seconds), 5(0) — 1, a = 0.05 (daily volatility of 5%), a = 1 and 77 = 0.0001 (which is a very small 
inventory- risk aversion but enough to force the inventory to end the day flat in average). Under these pa- 
rameters we have a constant linear spread of ip^ — 0.0202 whilst the exponential spread is time-dependent. 

We chose three different values of /i: 0.98, 1.00 and 1.02. This corresponds, respectively, to a bet 
that the price will go down by 2%, will oscillate around its open price or go up by 2%. We used the 
linear utility function with inventory penalty and two strategies, a mean-reverting strategy (Ornstein- 
Uhlenbeck) with the correct directional bet, and the martingale strategy (arithmetic Brownian motion) 
with no directional bet. The martingale strategy performs a pure market-making strategy under inventory 
constraints. On the other hand, the mean-reverting strategy performs the same market-market strategy 
than the martingale but it also places directional bets, which can be seen not only in the agressiveness 
of the ask and bid quotes but also in the fact that the algorithm can place market orders. 
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Figure 3. Simulation of the market-making strategy under a mean-reverting mid-price dynamic with 
(asymptotic) mean /i — 0.98. Upper-Left: mid-price (black), optimal ask quote for the market-maker 
(dark blue), optimal bid quote (light green), fi (light blue). Lower-Left: Inventory for the 
mean-reverting process (black) vs the inventory for the martingale (blue). Upper- Right: cash. 
Lower-Right: PNL of the mean reverting process (black) compared with the benchmark, i.e. the PNL 

of the martingale strategy (blue). 



In Figure 3 we have plotted a realisation of the market-making strategy for /i = 0.98, i.e. assuming 
that the price will go down by 2% at the end of the day. If the market mid-price -black line- is above 
(resp. below) the optimal ask quote -dark blue- (resp. the optimal bid quote -light green-) then the 
market-maker sells (resp. buys) at market price, which we assume to coincide with the mid-price. 

• The mid-price starts at s = 1.00, it goes up to s = 1.03 at t = 0.1 and stays above s — 1.01 up 
to t = 0.2. Since the bet is that the price will converge down to fj, — 0.98, the market-maker sells 
the asset at market price. At t = 0.2 the market-maker has an inventory of q = —170 and a PNL 
below the martingale benchmark: as she sold her assets at the market price (mid-price), she paid 
the spread to mount her directional bet. 

• During the time interval (0.20, 0.50) the market-maker buys back her position via limit-orders. In 
order to favour the arrival of selling orders and deter buying orders she plays very aggressive bid 
quotes and very conservative ask quotes, hence the mid-price is closer to her bid quote than her 
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ask quotes. The strategy paid well because at i = 0.5 the mid-price converged to ^ = 0.98, her 
inventory went back to zero and her PNL outperformed the martingale benchmark. 

• On (0.50, 1.00) the market-maker does not make any directional bet, she only plays the bid-ask 
spread because the mid-price oscillates around fi = 0.98. As it can be seen, her bid and ask quotes 
are rather symmetric with respect to the mid-price, i.e. during all this non-directional period 
the strategy makes the same PNL than the martingale benchmark because both lines are almost 
parallel. 
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Figure 4. Simulation for fi — 1.00. 

In Figure 4 the market-maker assumes that the price will oscillate around fi — 1.00. Therefore, 
her quotes are symmetric near this threshold and are tilted when the prices wander far from it (i.e. she 
makes mean-reverting bets). Therefore, her inventory oscillates from positive on (0.00,0.2) to negative 
on (0.25,0.45), then back to positive and negative again. On (0.85,1.00) there is a huge drop in the 
mid-price, which in absence of inventory risk would imply a consequent positive inventory due to the 
mean-reverting dynamic. However, the inventory-risk-aversion forces her to avoid a directional bet at the 
end of the day, and as such her strategy is similar to the martingale case. It is worth to mention that 
the mid-price finished the day below the target of 1.00. In consequence, the market-making strategy is 
insensitive to peaks at the end of the day. 
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Figure 5. Simulation for /i = 1.02. 



In Figure 5 the market-maker assumes that the price wiU converge to fj, = 1.02. On (0.00,0.10) 
she mounts an inventory of g = 50 because she bets the mid-price wiU hit fi = 1.02. She ehminates 
her inventory during (0.20, 0.40) using hmit-orders with generous ask prices and not-very competitive 
bid prices, which translates into a greater flow of buying orders than selling orders. At t = 0.20 we can 
see that her directional bet has beaten the martingale benchmark. On (0.40,0.70) the mid-price rises 
from n = 1.02 and comes back, and since the market-maker is betting for a mean-reverting dynamic 
she builds up an inventory of q = —90 a,t t — 0.55. On (0.85, 1.00) the mid-price falls, but instead of 
making a U-turn in her inventory and turn it positive, as a mean-reverting dynamic suggests, she rather 
eliminates slowly her negative inventory by tilting her quotes towards the buying side. By doing so the 
market-maker avoids directional bets and minimises her market impact at the end of the trading day. 



7 Statistics of the PNL distributions 

Comparing linear and exponential utility strategies 

We performed 100,000 simulations with t] £ {0, 0.0001, 0.001}, 7 = 1 and e {0.98, 1.00, 1.02}. The rest 
of the parameters have the same values as for the previous simulations. 
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kurtosis 
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3.043 


6.176 




Jarque Bera 


30947.9 


16970.7 


19.1 


58949.6 




VaR 5% 


9.430 


-10.909 


10.086 


10.177 




VaR 1% 


8.346 


-25.726 


9.842 


9.465 


Inv 


mean 


0.087 


-333.297 


0.004 


-3.311 




std dev 


33.258 


418.200 


7.672 


15.227 




skewness 


-0.005 


0.622 


0.003 


0.007 




kurtosis 


2.981 


2.346 


3.001 


3.010 




Jarque Bera 


1.8 


8232.0 


0.2 


1.3 




Q(T) 90% 


[-55,55] 


[-847,463] 


[-13,13] 
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Table lA. Statistics of the PNL distribution, /j, = 0.98, rj = 0. ABM = arithmetic Brownian Motion 
without drift, MR = mean-reverting. The "Sharpc" value is the normalised return per risk unit, i.e. 

mean / std dev. 







linear ABM 


linear MR 


exp ABM 


exp MR 


PNL 


mean 


10.982 


11.576 


10.607 


10.945 




std dev 


0.412 


1.541 


0.347 


0.444 




"Sharpe" 


26.66 


7.51 


30.57 


24.65 




skewness 


0.023 


-2.164 


0.038 


-0.605 




kurtosis 


3.016 


11.418 


3.007 


4.460 




Jarque Bera 


10.2 


373263.9 


24.7 


14982.6 




VaR 5% 


10.308 


8.640 


10.041 


10.183 




VaR 1% 


10.026 


5.794 


9.809 


9.691 


Inv 


mean 


-0.020 


-1.739 


0.008 


-0.785 




std dev 


5.025 


8.428 


4.574 


5.696 




skewness 


0.006 


0.004 


-0.001 


-0.003 




kurtosis 


3.024 


2.995 


2.965 


3.000 




Jarque Bera 


3.0 


0.4 


5.2 


0.2 




Q(T) 90% 


[-8,8] 


[-16,12] 


[-8,8] 


[-10,9] 



Table IB. Statistics of the PNL distribution, fi = 0.98, rj = 0.0001. 
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VaR 5% 
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VaR 1% 
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9.252 


9.426 


Inv 


mean 
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std dev 
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Table IC. Statistics of the PNL distribution, /x = 0.98, r] = 0.001. 







linear ABM 


linear MR 


exp ABM 


exp MR 


PNL 


mean 


11.030 


13.457 


10.670 


11.039 




std dev 


0.992 


11.708 


0.356 


0.501 




"Sharpe" 


11.12 


1.15 


29.97 


22.03 




skewness 


0.014 


-0.676 


0.022 


-0.864 




kurtosis 


5.666 


5.176 


2.996 


5.632 




Jarque Bera 


29607.8 


27355.7 


8.4 


41315.1 




VaR 5% 


9.460 


-8.180 


10.086 


10.167 




VaR 1% 


8.368 


-22.205 


9.850 


9.524 


Inv 


mean 


-0.021 


0.055 


-0.013 


0.031 




std dev 


33.126 


468.213 


7.682 


15.245 




skewness 


-0.009 


0.000 


0.007 


0.007 




kurtosis 


2.990 


1.849 


3.024 


2.988 




Jarque Bera 


1.8 


5523.2 


3.2 


1.5 




Q(T) 90% 


[-55,55] 


[-732,731] 


[-13,13] 


[-25,25] 



Table 2A. Statistics of the PNL distribution, n = 1.00, ?? = 0. 
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0.5 
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Q(T) 90% 


[-8,8] 


[-14,14] 


[-7,8] 


[-9,9] 



Table 2B. Statistics of the PNL distribution, /x = 1.00, r] = 0.0001. 







linear ABM 


linear MR 


exp ABM 


exp MR 


PNL 


mean 


10.436 


10.483 


10.000 


10.226 




std dev 


0.335 


0.356 


0.325 


0.338 




"Sharpe" 


31.15 


29.45 


30.77 


30.25 




skewness 


0.031 


-0.097 


0.031 


-0.023 




kurtosis 


3.001 


3.162 


2.988 


3.069 




Jarque Bera 


15.5 


266.4 


16.5 


28.6 




VaR 5% 


9.888 


9.889 


9.469 


9.668 




VaR 1% 


9.658 


9.623 


9.246 


9.431 


Inv 


mean 


-0.011 


-0.009 


0.005 


0.006 




std dev 


1.673 


1.684 


1.672 


1.676 




skewness 


0.006 


-0.004 


-0.001 


0.002 




kurtosis 


3.026 


2.992 


3.025 


3.026 




Jarque Bera 


3.6 


0.5 


2.7 


3.0 




Q(T) 90% 


[-3,3] 


[-3,3] 


[-3,3] 


[-3,3] 



Table 2C. Statistics of the PNL distribution, jj, = 1.00, r] = 0.001. 
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Table 3A. Statistics of the PNL distribution, = 1.02, r] = 0. 







linear ABM 


linear MR 


exp ABM 


exp MR 


PNL 


mean 


10.982 


11.581 


10.605 


10.945 




std dev 


0.412 


1.517 


0.345 


0.446 




"Sharpe" 


26.66 


7.63 


30.74 


24.54 




skewness 


0.031 


-2.074 


0.028 


-0.647 




kurtosis 


3.052 


10.611 


3.011 


4.642 




Jarque Bera 


27.2 


313076.3 


13.9 


18207.8 




VaR 5% 


10.308 


8.659 


10.039 


10.182 




VaR 1% 


10.026 


5.977 


9.807 


9.662 


Inv 


mean 


-0.003 


1.712 


0.005 


0.795 




std dev 


5.024 


8.420 


4.574 


5.681 




skewness 


0.001 


-0.005 


0.010 


0.008 




kurtosis 


3.006 


2.976 


2.993 


2.980 




Jarque Bera 


0.2 


2.8 


1.9 


2.8 




Q(T) 90% 


[-8,8] 


[-12,16] 


[-7,8] 


[-9,10] 



Table 3B. Statistics of the PNL distribution, /x = 1.02, ry = 0.0001. 
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Table 3C. Statistics of the PNL distribution. ^ = 1.02, rj = 0.001. 



In Tables 1A-3C we can observe the effect of rj on the distribution of PNL and inventory: 

• The "Sharpe" ratio (that is mean over standard deviation) increased dramatically as rj increases 
for the linear utility models: the linear ABM has its "Sharpe" increased by a factor of 3 whilst 
the linear MR has a factor of 26. The same effect is seen for exponential MR with a factor of 1.5. 
However, for the exponential ABM there is not a significant change. 

• The inventory risk has an impressive reduction as t; increases; by a factor of 18 for linear ABM, 
230 for linear MR, 4 for exponential ABM and 8 for exponential MR. 

• The PNL mean decreases as 77 increases: -5% for linear ABM, -25% for linear MR, -6% for expo- 
nential ABM and -7% for exponential MR. 

• None of the the PNL distributions are not normal, but in all cases the inventory distributions can 
be considered as Gaussian because of their low Jarque-Bera scores. 



In consequence, the total effect of rj in the linear models is remarkable: it not only reduces the inventory 
risk, as expected, but as a welcoming side effect it also reduces the risks on the PNL distribution. Of 
course, nothing is free and this control on the risks comes with a reduction of the PNL mean. However, 
with the current framework a market-maker has enough room and tools to improve her PNL, given her 
risk budgets on inventory and PNL distribution. 



The effect of 77 on the linear-utility strategy 

As we have already seen, 77 has some indirect control on the the PNL distribution, but we wanted to have 
a more detailed view of this fact. We performed 20,000 Monte-Carlo simulations for n = 1.00. The rest 
of the parameters are as before. 
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Figure 6. Histogram of PNL as function of rj for linear utility function (/i = 1.00). Blue: Linear 
martingale. Red: Linear MR. Upper-Left: rj = 0, Lower-Left: rj = 0.0001, Upper- Right: rj = 0.0004, 

Lower-Right: t] = 0.001. 
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9.001 


10.115 


9.882 


VaR 1% 


8.306 


-22.297 
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Table 5. Statistics of the PNL distribution for linear MR as a function of rj. 



As we can see in Figure 6 and Table 5, between the martingale strategy and the linear MR strategy 
with 77 — 0.001 we trade 5% of our PNL to obtain a better Sharpe of a factor of 2.7, reduce our VaR(5%) 
by 8%, our VaR(l%) by 15% and our inventory risk by 95%. 

The effect of 7 on the exponential-utihty strategy 

We wanted to see if 7 can control directly the PNL distribution, as it is expected from an exponential 
utility function due to its variance-reduction features. In order to assess the effect of 7 separately from 
77, we performed 20,000 Monte-Carlo simulations for rj = and /i = 1.00. The rest of the parameters are 
as before. 
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Figure 7. Histogram of the PNL as a function of 7 for an exponential utility function (/i = 1.00). Blue: 
Linear martingale. Red: Exponential MR. Upper-Left: 7 = 0.1, Lower-Left: 7 — 0.3, Upper-right: 

7 = 0.7, Lower-Right: 7 = 1.0. 
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7 = 0.3 


7 = 0.7 


7 = 1 


mean 


11.023 


12.609 


11.581 


11.193 
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std dev 
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0.493 
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6.225 
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10.186 
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0.658 
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9.572 
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[-55,55] 


[-205,209] 
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[-34,34] 


[-25,25] 



Table 6. Statistics of the PNL distribution for exponential MR as a function of 7 {rj = 0). 

As we can see in Figure 7 and Table 6, between the linear martingale strategy and the exponen- 
tial MR strategy with 7 = 1 we have a better Sharpe of a factor of 2, reduce our VaR(5%) by 8%, our 
VaR(l%) by 14% and our inventory risk by 50%. In consequence, 7 controls directly the PNL distribution 
and also has indirect control on the inventory, but the effect of rj seems to be stronger in both risk factors. 

Observe that the indirect control of 7 on the inventory is less impressive than the indirect control of 
77 on the PNL distribution. However, with 7 the market-maker is not sacrificing any PNL at alln unlike 
she does with rj. 
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Comparing t] = 0.0001 and 7 = 0.3 

From Figures 6-7 we have that the hnear MR with 77 ~ 0.0001 and the exponential MR with 7 = 0.3 seem 
to have the same (statistical) mode than the bechmark linear martingale. Here we compare these two 
distributions. We performed 20,000 Monte-Carlo simulations for /x = 1.00 and kept the other parameters 
unchanged. 




Figure 8. Comparing the PNL histograms for linear MR {rj = 0.0001) and exponential MR (7 = 0.3). 

Blue: Linear MR. Red: Exponential MR. 
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-7.86 


skewness 


-1.832 
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8.95 
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9.305 


9.001 


3.27 
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7.281 
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[-72,74] 


[-14,14] 


-80.8 



Table 7. Statistical comparison between linear MR r] — 0.001 and exponential 7 = 0.3. The % change is 
the variation of the linear MR with respect to the exponential MR. Skewness change is in absolute value. 

As we can see in Table 7, seems that the exponential MR has better control on the PNL distribution 
than the linear MR: higher mean, "Sharpe" and quantiles at 1% and 5% (VaR); smaller standard devi- 
ation, (absolute) skewness and kurtosis. However, the inventory of exponential MR is much higher than 
the inventory of linear MR (5 times bigger). 
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8 Conclusions 

On the market-making model 

• In Theorem |4] we generalised the AveUaneda and Stoikov 1 approach for an exponential utility to 
any Markov process, provided its conditional expectation Et,s[S'(T)] is affine in s, which includes 
processes like arithmetic Brownian motion with drift or Ornstein-Uhlenbeck process. This allowed 
us to assess the effect of directional bets on the market mid-price on the PNL distribution of a 
high-frequency market-maker. Moreover, we also showed that the results of Lehalle et al [5] can 
be generalised to arithmetic Brownian motions with time-dependent drift and volatility, and that 
their approach cannot be extended further with the ansatz they chose. 

• If instead of an exponential utility function we choose a linear utility function with inventory 
constraints, in Theorems [l][3] we showed that using the same stochastic-control approach we can 
find the optimal controls (i.e. the market-maker bid and ask quotes) for very general Markov 
processes, even with jumps (e.g. Levy processes), provided the inventory-risk penalty 7r(s) is chosen 
appropriately to ensure the boundedness of the value function u{t, s, q, x). Moreover, the optimal 
controls in the linear case are independent of the volatility of the asset, which is very hard to 
estimate for high-frequency data, and as such the linear case is easier to calibrate with real data 
than the exponential case. 

• Our approach, although based on optimal-control and nonlinear-PDE techniques, is very intuitive: 
choose the right ansatz for the solution, compute the (implicit) controls, plug them into the equation, 
separate the equation into smaller and easier parts and solve them all to have the explicit form of 
the solution and the controls. Moreover, if the full equation is not explicitly solvable, approximate 
the jump part due to inventory: the resulting control is thus optimal for a sub-solution, and as 
such we are controlling the PNL utility function from below, which translates into optimal quotes 
to reduce losses. 

On the role of the parameters t] and 7 

• If 77 increases then the inventory risk decreases and eventually becomes negligeable. This was 
expected because the parameter rj was added as an inventory penalty. However, as a nice side effect 
we have that rj also reduces the risk on the PNL, in the sense that the variance and the kurtosis 
of the PNL distribution decrease as 77 increases. In other words, 77 controls perfectly the inventory 
risk, and by doing so it indirectly controls the PNL distribution. 

• For 7 we have the a similar effect, but the other way around. As 7 increases the risk on the PNL 
distribution decrease, in particular the first four moments and the VaR. Moreover, as a side effect 
the inventory risk decreases as well. This implies that 7 directly controls the PNL distribution and 
indirectly controls the inventory risk. 

• This mirror-like, intertwined role of 77 and 7 highlights a intimate relation between inventory risk 
and the risk on the PNL distribution: if one decreases then necessarily the other has to do the 
same. This implies that a market-maker who chooses to reduce the risk on her PNL distribution 
necessarily reduces her inventory risk; conversely, a market-maker who reduces her inventory risk 
also reduces the risk on her PNL distribution. However, this risk reduction also implies a reduction 
on the average PNL. 
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• It seems that the control provided by rj on the hnear MR is better than the control provided by 
7 on the exponential MR. However, this claim has to be taken with a grain of salt because, for 
r] — 0.0001 and 7 = 0.3 (where qualitatively both distributions are comparable to the martingale 
benchmark) the linear MR is less risky in terms of inventory whilst the exponential MR is less risky 
in terms of PNL distribution. That said, our numerical simulations show that an increase on rj 
(from 0.0001 to 0.001) renders a more drastic reduction on inventory risk and stronger structural 
changes on PNL distribution than an increase on 7 of the same order of magnitude (from 0.1 to 1). 

On the directional bets and the risk profiles 

As we have shown in our simulations, there is a clear (and expected) relation between risk and reward. 

• When the market-maker makes a directional bet she improves her PNL up to 25% with respect to 
the martingale benchmark. However, by doing so she has to accept more risk, either on her PNL 
distribution (measured in terms of variance, skewness and kurtosis) or on her inventory. 

• After a directional bet, the market maker can trade some of her excess PNL (over the martingale 
benchmark) for a direct control on her risks. If she chooses to gain direct control on her PNL 
distribution directly (resp. inventory risk) then she gains some reduction on her inventory risk 
(resp. PNL distribution), but the latter cannot be controlled directly. In that spirit, the market- 
maker can choose to use an exponential MR strategy if she prices more dearly her risk on the PNL 
distribution and a linear MR strategy if her biggest concern is the inventory risk. 

• In summary, directional bets enhance the PNL of the market-maker but add extra risk on her 
inventory and PNL distribution; both risks are positively correlated, so if she reduces one directly 
the other is indirectly reduced as well. Therefore, a market-maker in practice has to assess three 
factors, i.e. PNL increase, inventory risk and PNL distribution, in order to choose her optimal 
trading strategy because, in the current set-up at least, there is no strategy that is optimal in all 
three factors. 

Further developments 

• In our model there are no market orders, just limit orders. For the simulations we assumed that 
if (5+ < (resp. < 0) then the market-maker sends a selling (resp. buying) market order that 
is executed at the mid-price, i.e. we assumed that the market spread is zero and their impact (or 
cost) on the PNL is \S^ \ > 0. 

• For a detailed market-impact analysis, we should add the market spread as another state variable 
and consider that the market-maker's limit orders affect both the market spread and the mid-price. 
For example, if she improves the best ask (resp. bid) then she reduces the market spread by one 
tick and pushes down (resp. up) the mid-price by half a tick. 

• Another possibility is to incorporate market orders directly into the model and making a clear 
distinction between market and limit orders (see e.g. Guilbaud and Pham [7]). However, the 
current framework does not seem to be easily extended for that purpose, which suggests that a new 
framework is needed. We are currently working on that direction. 
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• We have assumed that the mid-price is continuous, which can be interpreted in the discretisation 
for our simulations as assuming that the bid-ask spread of the market-maker is large with respect to 
small changes on the mid-price. This is true for assets whose spread is large with respect to the tick 
size (e.g. equities), but not for futures for which the spread is in average 1-2 ticks, so the smallest 
price move can make the market-maker's quotes cross the spread. Therefore, another framework 
is necessary to deal with assets whose spread and tick size are comparable. We will address this 
problem in a future work. 

• Our mid-price S{t) is a Markov processes, just like in Avellaneda and Stoikov [T] and Lehalle et 
al. This means that the infinitesimal generator is local, S{t) has zero auto-correlation and the 
Hamilton- Jacobi-Bellnian equation is a nonlinear PDE. Now, if we consider a mid-price process 
with non-zero autocorrelation (e.g. fractional Brownian Motion, multifractal processes or Hawkes 
process) then its infinitesimal operator is non-local and its HJB equation is an integro-differential 
equation. However, in this case we cannot invoke the Feynman-Kac formula, which means that our 
approach is no longer valid and another model is necessary (see e.g. Cartea and Jaimungai [3] and 
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